NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations

Rout, Litu; Chen, Yujia; Ruiz, Nataniel; Caramanis, Constantine; Shakkottai, Sanjay; Chu, Wen-Sheng (May 2025, ICLR 2025)

Generative models transform random noise into images, while their inversion aims to reconstruct structured noise for recovery and editing. This paper addresses two key tasks: (i) inversion and (ii) editing of real images using stochastic equivalents of rectified flow models (e.g., Flux). While Diffusion Models (DMs) dominate the field of generative modeling for images, their inversion suffers from faithfulness and editability challenges due to nonlinear drift and diffusion. Existing DM inversion methods require costly training of additional parameters or test-time optimization of latent variables. Rectified Flows (RFs) offer a promising alternative to DMs, yet their inversion remains underexplored. We propose RF inversion using dynamic optimal control derived via a linear quadratic regulator, and prove that the resulting vector field is equivalent to a rectified stochastic differential equation. We further extend our framework to design a stochastic sampler for Flux. Our method achieves state-of-the-art performance in zero-shot inversion and editing, surpassing prior works in stroke-to-image synthesis and semantic image editing, with large-scale human evaluations confirming user preference. See our project page https://rf-inversion.github.io/ for code and demo.
more » « less
Free, publicly-accessible full text available May 1, 2026
Infilling Score: A Pretraining Data Detection Algorithm for Large Language Models

Raoof, Negin; Rout, Litu; Daras, Giannis; Sanghavi, Sujay; Caramanis, Constantine; Shakkottai, Sanjay; Dimakis, Alex (March 2025, ICLR 2025)

In pretraining data detection, the goal is to detect whether a given sentence is in the dataset used for training a Large Language Model LLM). Recent methods (such as Min-K % and Min-K%++) reveal that most training corpora are likely contaminated with both sensitive content and evaluation benchmarks, leading to inflated test set performance. These methods sometimes fail to detect samples from the pretraining data, primarily because they depend on statistics composed of causal token likelihoods. We introduce Infilling Score, a new test-statistic based on non-causal token likelihoods. Infilling Score can be computed for autoregressive models without re-training using Bayes rule. A naive application of Bayes rule scales linearly with the vocabulary size. However, we propose a ratio test-statistic whose computation is invariant to vocabulary size. Empirically, our method achieves a significant accuracy gain over state-of-the-art methods including Min-K%, and Min-K%++ on the WikiMIA benchmark across seven models with different parameter sizes. Further, we achieve higher AUC compared to reference-free methods on the challenging MIMIR benchmark. Finally, we create a benchmark dataset consisting of recent data sources published after the release of Llama-3; this benchmark provides a statistical baseline to indicate potential corpora used for Llama-3 training.
more » « less
Free, publicly-accessible full text available March 26, 2026
Global Optimality of the EM Algorithm for Mixtures of Two-Component Linear Regressions

https://doi.org/10.1109/TIT.2024.3435522

Kwon, Jeongyeol; Qian, Wei; Chen, Yudong; Caramanis, Constantine; Davis, Damek; Ho, Nhat (September 2024, IEEE Transactions on Information Theory)

Full Text Available
RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control

Rout, Litu; Chen, Yujia; Ruiz, Nataniel; Kumar, Abhishek; Caramanis, Constantine; Shakkottai, Sanjay; Chu, Wen-Sheng (May 2024, https://doi.org/10.48550/arXiv.2405.17401)

The authors propose Reference-Based Modulation (RB-Modulation), a plug-and-play, training-free solution for personalization of diffusion models. Existing training-free methods face challenges in (a) extracting style from reference images without additional style or content text descriptions, (b) avoiding unwanted content leakage from style references, and (c) composing style and content effectively. RB-Modulation addresses these issues using a novel stochastic optimal controller, where a style descriptor encodes the desired attributes through a terminal cost. The induced drift ensures high fidelity to the reference style while adhering to the text prompt. Additionally, the authors introduce a cross-attention-based feature aggregation scheme that decouples content and style from the reference image. With both theoretical justification and empirical validation, RB-Modulation demonstrates precise control of content and style in a training-free manner, while enabling seamless composition—eliminating reliance on external adapters or ControlNets.
more » « less
Full Text Available
Optimizing Solution-Samplers for Combinatorial Problems: The Landscape of Policy-Gradient Methods

Caramanis, Constantine; Fotakis, Dimitris; Kalavasis, Alkis; Kontonis, Vasilis; Tzamos, Christos (December 2023, 37th Annual Conference on Neural Information Processing Systems, 2023)

Full Text Available
Learning To Maximize Welfare with a Reusable Resource

https://doi.org/10.1145/3530893

Faw, Matthew; Papadigenopoulos, Orestis; Caramanis, Constantine; Shakkottai, Sanjay (May 2022, Proceedings of the ACM on Measurement and Analysis of Computing Systems)

Considerable work has focused on optimal stopping problems where random IID offers arrive sequentially for a single available resource which is controlled by the decision-maker. After viewing the realization of the offer, the decision-maker irrevocably rejects it, or accepts it, collecting the reward and ending the game. We consider an important extension of this model to a dynamic setting where the resource is "renewable'' (a rental, a work assignment, or a temporary position) and can be allocated again after a delay period d. In the case where the reward distribution is known a priori, we design an (asymptotically optimal) 1/2-competitive Prophet Inequality, namely, a policy that collects in expectation at least half of the expected reward collected by a prophet who a priori knows all the realizations. This policy has a particularly simple characterization as a thresholding rule which depends on the reward distribution and the blocking period d, and arises naturally from an LP-relaxation of the prophet's optimal solution. Moreover, it gives the key for extending to the case of unknown distributions; here, we construct a dynamic threshold rule using the reward samples collected when the resource is not blocked. We provide a regret guarantee for our algorithm against the best policy in hindsight, and prove a complementing minimax lower bound on the best achievable regret, establishing that our policy achieves, up to poly-logarithmic factors, the best possible regret in this setting.
more » « less
Full Text Available
Asymptotically-Optimal Gaussian Bandits with Side Observations

Atsidakou, Alexia; Papadigenopoulos, Orestis; Caramanis, Constantine; Sanghavi, Sujay; Shakkottai, Sanjay (January 2022, Proceedings of the 39th International Conference on Machine Learning)

Full Text Available
The EM Algorithm gives Sample Optimality for Learning Mixtures of Well-Separated Gaussians

Kwon, Jeongyeol; Caramanis, Constantine (July 2020, Proceedings of Thirty Third Conference on Learning Theory)

We consider the problem of spherical Gaussian Mixture models with 𝑘≥3 components when the components are well separated. A fundamental previous result established that separation of Ω(log𝑘‾‾‾‾‾√) is necessary and sufficient for identifiability of the parameters with \textit{polynomial} sample complexity (Regev and Vijayaraghavan, 2017). In the same context, we show that 𝑂̃ (𝑘𝑑/𝜖2) samples suffice for any 𝜖≲1/𝑘, closing the gap from polynomial to linear, and thus giving the first optimal sample upper bound for the parameter estimation of well-separated Gaussian mixtures. We accomplish this by proving a new result for the Expectation-Maximization (EM) algorithm: we show that EM converges locally, under separation Ω(log𝑘‾‾‾‾‾√). The previous best-known guarantee required Ω(𝑘‾‾√) separation (Yan, et al., 2017). Unlike prior work, our results do not assume or use prior knowledge of the (potentially different) mixing weights or variances of the Gaussian components. Furthermore, our results show that the finite-sample error of EM does not depend on non-universal quantities such as pairwise distances between means of Gaussian components.
more » « less
Full Text Available
Combinatorial Blocking Bandits with Stochastic Delays

Atsidakou, Alexia; Papadigenopoulos, Orestis; Basu, Soumya; Caramanis, Constantine; Shakkottai, Sanjay (January 2021, Proceedings of the 38th International Conference on Machine Learning, PMLR 139, 2021.)
null (Ed.)
Full Text Available
EM Converges for a Mixture of Many Linear Regressions

Kwon, Jeongyeol; Caramanis, Constantine (January 2020, Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics)

Full Text Available

« Prev Next »

Search for: All records